AITopics | Myrtle Beach

Collaborating Authors

Myrtle Beach

Why and When LLM-Based Assistants Can Go Wrong: Investigating the Effectiveness of Prompt-Based Interactions for Software Help-Seeking

Khurana, Anjali, Subramonyam, Hari, Chilana, Parmit K

arXiv.org Artificial IntelligenceFeb-12-2024

Large Language Model (LLM) assistants, such as ChatGPT, have emerged as potential alternatives to search methods for helping users navigate complex, feature-rich software. LLMs use vast training data from domain-specific texts, software manuals, and code repositories to mimic human-like interactions, offering tailored assistance, including step-by-step instructions. In this work, we investigated LLM-generated software guidance through a within-subject experiment with 16 participants and follow-up interviews. We compared a baseline LLM assistant with an LLM optimized for particular software contexts, SoftAIBot, which also offered guidelines for constructing appropriate prompts. We assessed task completion, perceived accuracy, relevance, and trust. Surprisingly, although SoftAIBot outperformed the baseline LLM, our results revealed no significant difference in LLM usage and user perceptions with or without prompt guidelines and the integration of domain context. Most users struggled to understand how the prompt's text related to the LLM's responses and often followed the LLM's suggestions verbatim, even if they were incorrect. This resulted in difficulties when using the LLM's advice for software tasks, leading to low task completion rates. Our detailed analysis also revealed that users remained unaware of inaccuracies in the LLM's responses, indicating a gap between their lack of software expertise and their ability to evaluate the LLM's assistance. With the growing push for designing domain-specific LLM assistants, we emphasize the importance of incorporating explainable, context-aware cues into LLMs to help users understand prompt-based interactions, identify biases, and maximize the utility of LLM assistants.

assistance, llm, participant, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3640543.3645200

2402.0803

Country:

North America > United States > New York > New York County > New York City (0.06)
North America > United States > South Carolina > Greenville County > Greenville (0.05)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
(16 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study > Negative Result (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Analyzing Chain-of-Thought Prompting in Large Language Models via Gradient-based Feature Attributions

Wu, Skyler, Shen, Eric Meng, Badrinath, Charumathi, Ma, Jiaqi, Lakkaraju, Himabindu

arXiv.org Artificial IntelligenceJul-25-2023

Chain-of-thought (CoT) prompting has been shown to empirically improve the accuracy of large language models (LLMs) on various question answering tasks. While understanding why CoT prompting is effective is crucial to ensuring that this phenomenon is a consequence of desired model behavior, little work has addressed this; nonetheless, such an understanding is a critical prerequisite for responsible model deployment. We address this question by leveraging gradient-based feature attribution methods which produce saliency scores that capture the influence of input tokens on model output. Specifically, we probe several open-source LLMs to investigate whether CoT prompting affects the relative importances they assign to particular input tokens. Our results indicate that while CoT prompting does not increase the magnitude of saliency scores attributed to semantically relevant tokens in the prompt compared to standard few-shot prompting, it increases the robustness of saliency scores to question perturbations and variations in model output.

large language model, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2307.13339

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > Mexico (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)
(9 more...)

Genre: Research Report > New Finding (0.65)

Industry:

Media > Film (1.00)
Consumer Products & Services (1.00)
Health & Medicine (0.93)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Large Language Models Based Automatic Synthesis of Software Specifications

Mandal, Shantanu, Chethan, Adhrik, Janfaza, Vahid, Mahmud, S M Farabi, Anderson, Todd A, Turek, Javier, Tithi, Jesmin Jahan, Muzahid, Abdullah

arXiv.org Artificial IntelligenceApr-17-2023

Software configurations play a crucial role in determining the behavior of software systems. In order to ensure safe and error-free operation, it is necessary to identify the correct configuration, along with their valid bounds and rules, which are commonly referred to as software specifications. As software systems grow in complexity and scale, the number of configurations and associated specifications required to ensure the correct operation can become large and prohibitively difficult to manipulate manually. Due to the fast pace of software development, it is often the case that correct software specifications are not thoroughly checked or validated within the software itself. Rather, they are frequently discussed and documented in a variety of external sources, including software manuals, code comments, and online discussion forums. Therefore, it is hard for the system administrator to know the correct specifications of configurations due to the lack of clarity, organization, and a centralized unified source to look at. To address this challenge, we propose SpecSyn a framework that leverages a state-of-the-art large language model to automatically synthesize software specifications from natural language sources. Our approach formulates software specification synthesis as a sequence-to-sequence learning problem and investigates the extraction of specifications from large contextual texts. This is the first work that uses a large language model for end-to-end specification synthesis from natural language texts. Empirical results demonstrate that our system outperforms prior the state-of-the-art specification synthesis tool by 21% in terms of F1 score and can find specifications from single as well as multiple sentences.

large language model, machine learning, specification, (20 more...)

arXiv.org Artificial Intelligence

2304.09181

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > New York > New York County > New York City (0.05)
North America > United States > Texas > Brazos County > College Station (0.04)
(9 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Information Technology (0.67)
Education (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Can AI answer your money questions? We put chatbots to the test

#artificialintelligenceApr-14-2023, 03:44:48 GMT

NEW YORK, April 13 (Reuters) - Face it, we could all use a little help with our money. So who better to ask for personal finance advice than a couple of the most powerful chatbots on the planet? Both OpenAI's ChatGPT and Google's Bard are dominating headlines recently, for their generative capabilities and vast storehouses of information. Each has far more processing power than, say, any individual personal finance writer (ahem). What is one great business idea?

bard, chatbot, chatgpt, (13 more...)

#artificialintelligence

Country:

North America > United States > New York (0.25)
North America > United States > Utah > Utah County > Provo (0.05)
North America > United States > Texas > Travis County > Austin (0.05)
(8 more...)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback

How AI Could Change the Highly-Skilled Job Market

#artificialintelligenceJul-19-2020, 04:25:07 GMT

When most people think of the connection between technology and jobs, they think of robots and automation taking over relatively unskilled jobs like factory work. And thus, the biggest toll from these technological advances would be on already hard-hit manufacturing regions of the Rust Belt. But a new wave of developments in artificial intelligence may have a greater effect on high-skilled jobs and high-tech knowledge regions. The study by Mark Muro, Jacob Whiton, and Robert Maxim takes a close look at the potential of artificial intelligence--or AI--to automate tasks that until now have required human intelligence and decision-making. As they put it: "Unlike robotics (associated with the factory floor) and computers (associated with routine office activities), AI has a distinctly white-collar bent."

artificial intelligence, exposure, highly-skilled job market, (15 more...)

#artificialintelligence

Country:

North America > United States > Utah > Salt Lake County > Salt Lake City (0.05)
North America > United States > Texas > Hidalgo County > McAllen (0.05)
North America > United States > South Carolina > Horry County > Myrtle Beach (0.05)
(8 more...)

Technology: Information Technology > Artificial Intelligence > Robots (0.61)

Add feedback

Automation and AI sound similar, but may have vastly different impacts on the future of work

#artificialintelligenceJan-30-2020, 01:35:37 GMT

Last November, Brookings published a report on artificial intelligence's impact on the workplace that immediately raised eyebrows. Many readers, journalists, and even experts were perplexed by the report's primary finding: that, for the most part, it is better-paid, better-educated white-collar workers who are most exposed to AI's potential economic disruption. This conclusion--by authors Mark Muro, Robert Maxim, and Jacob Whiton--seemed to fly in the face of the popular understanding of technology's future effects on workers. For years, we've been hearing about how these advancements will force mainly blue-collar, lower-income workers out of jobs, as robotics and technology slowly consume those industries. In an article about the November report, The Mercury News outlined this discrepancy: "The study released Wednesday by the Brookings Institution seems to contradict findings from previous studies--including Brookings' own--that showed lower-skilled workers will be most affected by robots and automation, which can involve AI."

automation, different impact, intelligence, (16 more...)

#artificialintelligence

Country:

North America > United States > Nevada > Clark County > Las Vegas (0.06)
North America > United States > Utah > Salt Lake County > Salt Lake City (0.05)
North America > United States > Texas > El Paso County > El Paso (0.05)
(6 more...)

Genre: Research Report (1.00)

Industry: Media > News (0.35)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.51)

Add feedback

Feature and TV films

Los Angeles TimesJun-9-2018, 00:56:56 GMT

Mr. Smith Goes to Washington 1939 TCM Tue. 7 p.m. Mean Streets 1973 Cinemax Sun. 6 a.m. Batman Begins 2005 AMC Sun. Throw Momma From the Train 1987 EPIX Sun. Die Hard 1988 IFC Sun. I Know What You Did Last Summer 1997 Starz Tue. Gone in 60 Seconds 2000 CMT Wed. 8 p.m., Thur. Total Recall 1990 Encore Thur. 2 a.m. A Fish Called Wanda 1988 Encore Thur. 2 p.m., 9 p.m. The World Is Not Enough 1999 EPIX Sat. 4 p.m. Look Who's Talking 1989 OVA Sun. Die Hard With a Vengeance 1995 IFC Thur. Oil-platform workers, including an estranged couple, and a Navy SEAL make a startling deep-sea discovery. A clueless politician falls in love with a waitress whose erratic behavior is caused by a nail stuck in her head. After glimpsing his future, an ambitious politician battles the agents of Fate itself to be with the woman he loves. To help a friend, a suburban baby sitter drives into downtown Chicago with her two charges and a neighbor. Two teenage baby sitters and a group of children spend a wild night ...

artificial intelligence, pg-13, thur, (17 more...)

Los Angeles Times

Country:

North America > Mexico (0.45)
Asia > North Korea (0.27)
North America > United States > Illinois > Cook County > Chicago (0.24)
(84 more...)

Genre: Personal (0.67)

Industry:

Media > Television (1.00)
Media > Film (1.00)
Leisure & Entertainment > Sports > Football (1.00)
(8 more...)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

Feature and TV films

Los Angeles TimesMar-25-2016, 18:36:03 GMT

The Lost World: Jurassic Park 1997 AMC Sun. Tomorrow Never Dies 1997 EPIX Wed. 10 p.m., Thur. The X-Files: Fight the Future 1998 IFC Thur. Hard to Kill 1990 Sundance Mon. 8 p.m., Tue. A scientist gives his bodyguard superhuman powers in order to fight racists. A lawyer unwittingly becomes friends with an unstable woman who has a criminal history. A successful businesswoman puts her family, career and life on the line to satisfy her addiction to sex. With his father trapped in the wreckage of their spacecraft, a youth treks across Earth's now-hostile terrain to recover their rescue beacon and signal for help. In the future a cutting-edge android in the form of a boy embarks on a journey to discover his true nature. An 11-year-old boy experiences the worst day of his young life but soon learns that he's not alone when other members of his family encounter their own calamities. A struggling writer falls in love with a stenographer while trying to finish his new novel in 30 days.

artificial intelligence, indy and import movie, star film box office, (14 more...)

Los Angeles Times

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > The Bahamas (0.14)
North America > United States > Illinois > Cook County > Chicago (0.04)
(70 more...)

Genre: Personal (1.00)

Industry:

Transportation > Ground > Road (1.00)
Transportation > Air (1.00)
Media > Television (1.00)
(17 more...)

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Communications (0.87)

Add feedback